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tors can be used as a novel anticancer drug, etc. 
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Description 

Technical Field 

s [0001] The present invention relates to a protein factor involved in angiogenesis in humans and falls in the field of 
genetic engineenng. 

Background Art 

10 [0002] The process of angiogenesis. in which endothelial cells existing in the inner wall of blood vessels of animals 

EES?- T . VeSSe,S - fe tri99ered * fransducfo " <* a signal- A variety of substances^ V££ 
^ ^^Snal transducUon. The most notable substance among them is the vascular endothelia. grcXactor 
(VEGF). VEGF is a protein factor which was isolated and purified, and can increase the proliferation of endothelial cells 
andthe P«^«y °» Wood vessels (Senger, D. R. etal.. Science219: 983-985 (1983); Ferrara. N. andHenS W 7 
^ ^T^ S - °r mm - 161 : 851 " 858 < 1989 >'" » has *«> reported that the human VEGF genTcorrtains 
Pr0d " C f fouf 01 121 ' 16S - 189 - ° r 206 ■*» acid residues, depending on the dH- 

wence m splicing, which causes different secretionpatterns (Houck. K. A. et al.. Mo). Endocrinol. 5: 1806-1814 (1991)) 
H has also been reported that there is a VEGF-specific receptor, fit-1 . and lhat the binding of VEGF to flt-1 is important 
for the signal transduction (Vries. C. D. et al., Science 255: 989-991 (1992)) important 

20 2S2 r C ^Tw^ 1 ? C,0r «i PIGF) ** P ,ate,eMerived Orowth factor (PDGF) have thus far been isolated and are 
factors related to VEGF. These factors are found to promote proliferation activities of vascular endothelial cells (Maoli- 

^ vfcAc o^l ACad ^ USA 881 9267 " 9271 0991): Betsholtz. C. et al.. Nature 320: 695-699 (1986)) In 
addrtion VEGF-B (Olofsson, B. et al.. Proc. Natl. Acad. Sci. USA 93: 2576-2581 (1996)) and VEGF-C LeTj S 

» itS. ^ 1988 " 1992 (1996 ^ 0utov ' V - et BO J. 15. 290-299 (1996)) have r^y been 

r 1 ^! 1^1 faCt0rS appear t0 eons,i,ute a fe" 1 " 1 /. and this may contain additional unknown factors 

„^f ?f T SU99eSte f ^ VEGF is invDlved in ^ vascular formati ° n at «" a developmental stage but also 
in the patholog,cal neovascularization associated with diabetes, rheumatoid arthritis, retinopathy, and the growth of 
solid tumors Furthermore. ,n addition to its vascular endothelial cell growth-promoting effects listed above. VEGFs abN- 

SuJesTsTml^ Tr^^ SU99eS,6d '° be inv0,Ved in "s™ forma «°" resul «"9 *«" various 
causes. Also, these VEGF family factors may act on not only the blood vessels but also the blood cells and the lymphatic 

vesse s. They may thus play a role in the differentiation and proliferation of blood cells and the formation of lymphatic 
n^jSniS"^ VEGF <amily <aCtore ^ PreSem ' y draWin9 extraordinar y attention *» develcSrHS 



35 



Disclosure of the Invention 



££2- * ^f" 8 * 816 P ra sent invention is to isolate a novel protein belonging to the VEGF family and a gene 
encoding the protein. We searched for genes having homology to VEGF-C. which is a recently cloned VEGF family 

w^ S l^ Pr «f.. S ! qUenCe Ta9S (KT) and Sequence Ta ^ Si,es < STS > in * ha <*nBank database. As a 
n!^L 1^ J "* aSSUmed to have "^^y t0 the C-terminal portion of VEGF-C. We then designed 
pmners basec i on tte sequence and amplified and isolated the corresponding cDNA using the 5' RACE method and 
the 3 RACE method. The nucleot.de sequence of the isolated cDNA was determined, and the deduced amino acid 
sequence therefrom revealed that the amino acid sequence had significant homology to that of VEoSSaS I on Te 

nat^VFGF m XT* T *" ^ * 3 M «* VE ^ family M S£ 

Tn p ^r T 3,50 succeeded in pressing the protein encoded by the isolated human VEGF-D gene 

ss£ «^ta3sr ss gen F r™ we teve suw:eeded in isoia,in9 - — - 

(D A protein shown by SEQ ID NO.1 or having the amino acid sequence derived therefrom in which one or more 
amino acids are substituted, deleted, or added; 

(2) A protein encoded by a DNA that hybridizes with the DNA shown by SEQ ID NO 2- 
55 (3) A DNA encoding the protein of (1); ' ' 

(4) A DNA hybridizing with the DNA shown by SEQ ID NO. 2; 

(5) A vector containing the DNA of (3) or (4); 

(6) A transformant carrying the vector of (5); 
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(7) A method of producing the protein of (1) or (2). which comprises cufturing the transformed of (6); 

(8) An antibody binding to the protein of (1) or (2); 

(9) A method of screening a compound binding to the protein of (1) or (2), which comprises a step of detecting the 
activity of the protein of (1) or (2) to bind to a test sample; and 

(10) A compound binding to the protein of (1) or (2), wherein said compound has been isolated by the method of 
(9). 

[0008] The protein of the present invention (VEGF-D) has significant homology to VEGF-C and can be considered to 
be a fourth factor of . the VEGF family. Since the major function of VEGF is vascular formation at the developmental 
ffi 8 ^l GF fe COnsidered to be involved m Pathological neovascularization associated with diabetes, rheuma- 
fanrtonr 8, rebnCpathy ' "* 9rWth 01 SOlid tumors - 11,6 P f0tein °" me Pfeserrt invention is thought to have similar 

[0009] A person skilled in the art could prepare functionally equivalent proteins through modifying VEGF-D of the 
present invention by adding, deleting, or substituting one or more of the amino acids of VEGF-D shown by SEQ ID NO 
1 using known methods. Modifications of the protein can also occur naturally in addition to the artificial modifications 
described above. These modified proteins are also included in the present invention. Known methods for adding delet- 
iSo 0r 4^f^T 9 amin0 aCidS include me wer,ap extension Polymerase chain reaction (OE-PCR) method (Gene. 

[001 0] The DNA encoding VEGF-D of the present invention, shown by SEQ ID NO. 2. is useful for isolating DMAs 
encoding the proteins having similar functions to VEGF-D in other organisms. For example, a person skilled in the art 
could routonefy isolate homologs of human VEGF-D of the present invention from other organisms by allowing the DNA 
shown by SEQ ID NO. 2. or part thereof, as a probe, to hybridize with the DNA derived from other organisms. The DNA 
that hybndizes with the DNA shown by SEQ ID NO. 2 is also included in the present invention. The other organisms 
include mice, rats, and rabbits. 

[001 1] The DNA encoding a protein that is functionally equivalent to VEGF-D usually has high homology to the DNA 
shown by SEQ ID NO 2. The high homology used herein means at least 70% or higher, more preferably 80% or higher 
and still more preferably 90% or higher of sequence homology. 

L°°! 2 1 J* eXamp ' e °' the "V 1 '*' 23 * 0 " conditions for isolating the DNA having high homology will be given below 
Prer^ndzation b performed in ExpressHyb Solution at 68°C for 30 minutes. The probe labeled with a radioisotope is 
denatured at 95»C to 1 00»C for 2 to 5 minutes and rapidly chilled on ice. The probe is added to a new ExpressHybSolu- 
n^t, «on to u the J so ! u,ion "nialning the probe and allowed to hybridize under a temperature gradient 

owe to 55 C for 2 hours. The blot is washed four times, for 1 0 minute each, with a 2 x SSC solution containing 0 05% 
SDS at room temperature. The blot is then washed with a 0.1 x SSC solution containing 0.1% SDS at 45°C for 3 min- 
utes. The blot is subjected to autoradiography. 

L°°I5i. ! X3mple °!^ e "y^ 380 " conditions for isolating the DNA having very high homology will be given below 
SSKS 8 ?^ ExpressHyb Solution at 6800 ° r 3" minutes. The probe labeled with a radioisotope is 

denatured at 95'C to 100°C for 2 to 5 minutes and rapidly chilled on ice. The probe is added into a new ExpressHyb 
Solution. The blot is transfenred into the solution containing the probe, and allowed to hybridize at 68°C for 1 hour The 
blot was washed four times, for 10 minute each, with a 2 x SSC solution containing 0.05% SDS at room temperature 
The blot was then washed with a 0.1 X SSC solution containing 0.1% SDS at 50'C for 40 minutes, during which the 
solution was replaced once. The Mot was then subjected to autoradiography. 

[0014] Note that the hybridization condition can vary depending on the length of the probe (whether it is an oligomer 
or a probe with more than several hundred bases), the labeling method (whether the probe is radioisotopically labeled 
or r^radiorsotop«ally labeled), and the type of the target gene to be cloned. A person skilled in the art would properly 
select the suitable hybridization conditions, m the present invention, it is especially desirable that the condition does not 
allow the probe to hybridize with the DNA encoding VEGF-C. 

[001 5] The DNA of the present invention is also used to produce VEGF-D of the present invention as a recombinant 
P ^ 1 „*f iical| y : me recombinant protein can be produced in large quantity by incorporating the DNA encodinq 
VEGF-D (for example, the DNA shown by SEQ ID NO. 2) into a suitable expression vector, introducing the resulting vec- 
tor into a host, and culturing the transformant to allow the recombinant protein to be expressed 
[0016] The vector to be used for producing the recombinant protein is not particularly restricted. However vectors 
S f - aS .^u MEX ' 1 (Prome 9 a > or P EF - B0S < Nuc| eic Acids Res 1990. 18(1 7): p.5322) are preferable.. Suitable exam- 
ples of the host mto which the vector is introduced include E. coli cells. CHO cells, and COS cells 
[0017] The VEGF-D protein expressed by the transformant can be purified by suitably combining purification treat- 
ments such as solubilization with a homogenizer or a sonicator. extraction by various buffers, solubilization or precipi- 
tation by aad or alkali, extraction or precipitation with organic solvents, salting out by ammonium sulfate and other 
agents. diaJysis. ultrafiltration using membrane fDters. gel filtration, ion exchange chromatography, reversed-phase 
chromatography, counter-current distribution chromatography, high-performance liquid chromatography, isoelectric 



3 



EP0935001A1 



focusing, gel electrophoresis, or affinity chromatography in which antibodies or receptors are immobilized 
10018] Once the recombinant protein is obtained, antibodies against it can be prepared using known methods. The 
known methods include preparing polyclonal antibodies by immunizing rabbits, sheep, or other animals with the purified 
protein, and preparing monoclonal antibodies from the antibody-producing cells of immunized mice or rats. These antj- 
5 bodieswnmake .t possWe to quantify VEGF. Although the antibodies thus obtained can be used as they are. ft will be 
™ e ;f<fr l ° «L e »"e humanized antibodies to reduce the immunogenic^. The methods of humanizing the antibod- 

T ^1 f m6ttl0d 3nd ,he method <* direct,y producin 9 a human artibody. In the CDR Graft method 
^ ^ 9ene ! fr ° m ,he monodonal antibody-produdng cells and its antigenic determinant portion is trans- 

Ptentedinto an existing human antibody. In the method of directly producing a human antibody, a mouse whose immune 

v^TrST r ^ C !?^! hum r immune sy8tem fe immunized, similar to ordinary monoclonal antibodies. The 
method * ^ adminis,ered into «he body by subcutaneous injection or a similar 

EL* PefSOn SW " ed ^ C ° Uld SCree " Compounds that 10 *» P ro »«' n °» the present invention by known 

" ISp^ T" 1 ^ SUCh com f ounds <*" 06 ^ned by making a cDNA library on a phage vector (such as Xgt1 1 

SiUSm t0 T thB Pr0,ei " ^ binds to the protein of the present invention (such as lung, 
small intestine, and heart cells of mammals), expressing the cDNAs on LB-agarose. fixing the expressed proteins onto 
a fitter, preparing the, purified protein of the present invention as a biotin-labeled or a fusion protein with the GST protein 
and reacting this protein with the above filter. The desired compounds could then be detected by west western blotting 

Drepps. A. Ullnch. A., and Schlessmger, J. (1991) Cloning of P13 Wnase-associated p85 utilizing a novel method fo; 
express.orVdon.ng of target proteins for receptor tyrosine kinases. Cell 65: 83-90). Another method comprises the fol- 
lowing steps. First, express the protein of the present invention fused with the SRF binding domain or the GAL4 binding 
domain m yea* celte Second, prepare a cDNA library which expresses cDNAs fused with the transcription activation 
™ Z ^ fr ° m ? C6,,S 6X1360,6(1 10 a bind8 to the protein of the preset Sfon 

?n m "^f 6 ^! . ,nt ° ,h6 ab0V6 y6aSt Cells " Fourth - isola te the library-derived cDNA from the positive clones' 
Rnal^ntroduce the isolated cDNA into E. cdi to allow it to be expressed. (When a protein that binds to the protein d 

5fJ ^S^TT ' S r Pr !!^ L 685 * C6,IS ' 1,16 r6port6r 9ene is activa,ed «* the P^tive clone can be detected.) 
» ^^l^'f farm ^ US "S the two-hybrid system (MATCHMAKER Two-Hybrid system. Mammalian MATCH- 
!i , * !^ ^ 0r MAT CHMAKER One-Hybrid System (all by Oontech) orthe HybriZAP Two-Hybrid 
Vector System (Stratagene) (Datton. S. and Treisman. R. (1992) CharacJafan of SAP-1. acrolein reSSby 

S^LTE?* to C " foS S6rum response element> Ce " 68: S 97 " 612 )- Alternatively, the binding proteins can 
b *™?Z by a cDNA «°*ry '«>m the cells expected to express a substance, such as a receptor^ 

» n£!r >,e, » 1 B Pr f 6m inV6nti0n (<or exam P ,e ' vas oolar endothelial cells, bone marrow cells, or lymph duct 

S2J £ UC "2 1 . rt . ,nto 06118 as C °S. detecting the binding of the protein of the present invention byteett or 
S^Vrf TT.^v , ' uorescence ' and proteins that bind to the protein of the present invention 
O^ata. K. Taga. T. Urate. Y.. Yawata. H.. Kawanishi. Y. Seed. a. Taniguchi. T. Hirano. T, and Kishimoto. T. (1988) 
l i? ^TT! hUman ln,e r |euki n-6 (BSF-2/IFN beta2) receptor. Science 241: 825-828. Fukunaga R 
Ishizaka-lkeda, E., Seto, Y. and Nagata. S. (1990) Expression cloning of a receptor tor murine granulocyte Tony stim! 

C l" : r i " 350) - S,i " an0th6r "^^ the culture supernatant or the cellar extract 

22SL7 e( 11 to 6XPreSS 8 Pr ° t6in Wnd8 to ^ Pr0t6in 01 the P resem inventi O" onto an affinity column to 
^TSSS^l ^ PrK !? ' nV ! nt0n haS b66n immobi,i2ed - Pitying the proteins specifically bound to the col- 
^J. ' 6 09 ,h6 Pr0t6 " 1 8131 binds ,0 1,19 protein °» me P resent inve "tion can be obtained by deter- 

s ^Z™ „ Za^k I- me binding Pr ° ,ein ' Syrtnesi2in 0 oligonudeotides based on the sequence, and 

s screening a cDNA library with the oligonucleotides as probes. 

f^^T"? f° mpo . unds mat bind to the P«aein of the present invention can be screened by contacting com- 
2fl ^k 06 ." K ?' \ rand ° m P" 396 Pep *' de display ,ibrary ^ the immobilized protein of the present 
T^»„ th l m ° l6CU,eS bound ,0 8,6 ««•*• Th 680 ^Pounds can also be screened by high through. 

* ^ L S - J ° hnS0 . a °- U Ba,T6,, • R " W ' J0lliffe ' L K - ^ Dower. « J-. Sr4a peptides^ poten 
^ n ?, r 0 " 6 erythroDOie t in . Soienoe (Un»ed States) Jul 26 1996. 273: 458-464. Verdine. G.UThe 

n! ^ 0 i na 1 ,Ure • NatUre (En9 ' and) N0V 7 1996> 3841 11 " 13 - "OS 3 "- J C - Jr - Directed combinatorial 
chemistry. Nature (England) Nov 7 1996. 384: 17-19). 

; Sfvpr 5? n^ir* 0 " 66 ** 96,16 ,h6rapy by introducin9 *• VEGF -° S^e into the body 
l7 a .« ^ ^ r* k!^ : de,,c,ency °r expressing the gene in the body. An anti-sense DNAof the VEGF-D gene 
E " th6 1 * ^ 9ene mereby suppressing the pathological neov^larizata 

S JS^^r* to ^"oo *e VEGF-D gene or its antisense DNA into the body, the ret- 

rovirus method, the liposome method, the cationic liposome method, and the adenovirus method are preferable 
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[0024] In , orderto express these genes in the body, the genes can be incorporated into a suitable vector and intro- 

ItTl^ *?L * me retr0virus method - the "P 050 ™ method - »» ca«onic liposome meS or to aJZEs 
method. Although the vectors to be used are not particularly limited, such vectors aspAdexIcw S pZIPn^reT 

E?L P ^ eS T t L n Xl nti0n 8,50 be applied for dia 3nosing disorders caused by abnormalities of the VE6F-D 

gene for example, by PCR to detect an abnormality of the nucleotide sequence of the VEGF-D gene 

[0026] Furthermore, according to the present invention, the VEGF-D protein or its agonists can be used to heal 

Z^ Pr0m0te °° m T VeSSe ' fo " na,i0n ' 8nd aW ""We* by the hematopoietic stem cS b?t££ £££ 
tage of the ang.ogemc effect of the VEGF-D protein. The anttoodies against the VEGF-D protein or te iSSSi 
be used as the therapeutic agents for pathological neovascularization, lymphatic dyspasia. dyshematoS? weo^ 
mas ans.ng from various causes. The arrti-VEGF-D antibodies can be used for dkonosftx, idtera^m 
abnormal production of VEGF-D by quantifying VEGF-D. °«gnosing diseases resulting from 

Brief Descrip tion ol the Drawing 
[0027] 



Figure 1 shows the relationship among the VEGF-D gene, the EST sequences, and the primers used for cloning 
Figure 2 compares the amino acid sequences of EST (H24828) and VEGF-C 

VEGFfam^S 6 8min0 8Cid SeqUenC6S d6dUCed <r0m VEGF ' D 9enS ^ ,r0m 816 kn0W " 9en6S * 1,19 
SGF.Dlirna^«e ydr0Ph * iCi,y '** * VEGF ' a * P'"™ 0 " * *» deava 9 e *■ * *» 

25 Best Mode for Implementing the Invention 

S^ 1 ^ examp,es il,ustrate the P««* Mention In detail, but are not to be construed to limit the scope 



Example 1. Homology search by TFASTA method 

En th^SSEE CGPNKELDENTCQCVC (SEQ ID NO. 3) was designed based on the consensus sequence 
found m the BR3P (Balbian. nng 3 protein) repeat at the C-terminus of VEGF-C. The entire ESTs and STS sequences 
in the Genbank database (as of 29 February 1996) were then searched by the TFASTA met Kri (Kreon anTEn 
Proc. Natl. Acad. Sc.. USA 85: 2444-2448 (1988)). The searching conditions used are ishoT ££, %Sl !? 
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Table 1 


Sequences 


392,210 


Symbols 


135,585,305 


Word Size 


2 


Gap creation penalty 


12.0 


Gap extension penalty 


4.0 



[0030] As a result, an EST (Accession No. H24828) that is considered to code the consensus seouence was fh,.nrt 

vrer^id^ticaf F < rtt? °'" 16 i^^ S If^'f'r< et ' ^ ^ e ^ s ' 1 '-'*^ erc ' < ^ST Project and ni^oUoTl6^mino^W^sidues 
™ Urthef searcn,n 9 for Ur » Gen{ > by NCBI based on this sequence revealed thatfive registered seances 

fT64149 H24780 H24633. H24828. and T64277 (as of 1 March 1996». including the ab^YsrlJ^SSS to 
be derived from the same gene. T64277 and T64149. as we.l as H24828 and H24780. are the S ? 

sequence and the 3" sequence of the same clones, and the length of the insert in both oi K»W^9 » 

[0031 ] Translating the H24828 sequence into a protein sequence in a frame where homology is found suggested that 

SSc tt^SS^ZSZ aca •T fdues - Comparing 8,15 ■*» acid «*"£. with *. oSSEIS 

,h! n T"° a °* (27%) W6fe ,dentical - Morao « r - the amin ° adds *" ^e important for maintaining 

the protem structure, such as cysteine and proline, were well conserved (Fig. 2). Conserved sequences are s!ZJ! 
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black box. 
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Example 2. cDNA cloning from a library 

[0032] Primers for 5 RACE and 3' RACE (5 1 RACE primer: 5'-AGGGATGGGGAACTTGGAACGCTGAAT-3' (SEQ ID 
« ±!!,^ C ^ mer: S'^C^CCAGCACCCCAAAAACTGC-SXSEQ ID NO. 5)) were designed (Rg. 1 . A dou- 
ble-stranded cDNA was synthesized from human fung-derived polyA* RNA using reverse transcriptase PCR was then 
performed using Marathon-Ready cDNA, Lung (Chlontech). having an adapter cDNA ligated to both ends as a template 
cDNA, and using the above primer and adapter primer (AP-1 primer) as primers. The above adapter cDNA contains the 
regions to which the adapter primers AP-1 and AP-2 hybridize. The PCR was performed in a manner such that the sys- 
tem was exposed to treatment at 94°C for 1 min; five cycles of treatment at 94«C for 30 sec and at 72°C for 4 min- five 
cycles of treatment at 94<>C for 30 sec and at 70°C for 4 min; then 25 cycles of treatment at 94-C for 20 sec and at 68°C 
for 4 mm. (TaKaRa Ex Taq (Takara Shuzo) and the attached buffer were used as Taq polymerase instead of Advantage 
KlenTaq Polymerase Mix.) As a resufl. 1.5kb fragments were amplified at the 5 1 region and 0.9kb fragments at the 3' 
region These fragments were cloned with the pCR-Direct Cloning System (Clontech). CR-TRAP Cloning System (Gen- 
Hunter), and PT7Blue-T vector (Novagen). When the 5'-RACE fragment was cloned into the pCR-Direct vector the 
fragment was amplified again using 5'-CTGGTTCGGCCCAGAACTTGGAACGCTGAATCA-3' (SEQ No 71 and 5- 
CTCGCTCGCCCACTAATACGACTCACTATAGG-3' (SEQ ID NO. 8) as primers. No. 7) and 5 

20 Example 3. Nucleotide sequence analysis 

a°I^] o ABI PR,SM ^ Terminator Cvcle Sequencing Ready Reaction Kit with Amplitaq DNA Polymerase FS and 377 
^^^ Uen Z er (AB,) Wem USed f0f DNA s «1 uencin 9- primers used are the primers in the vectors (5'-AATTAAC- 
CCTCACTAAAGGG-3' (SEQ ID NO. 9), S'-CCAGGGTTTTCCCAGTCACGAC-S'fSEQ ID NO 10)) AP-2 Drimer (5- 
25 ACTCACTATAGGGCTCGAGCGGC-3* (SEQ ID NO 1 1)), and 10 primers in the sequence shown ZovfjJZ ™. ' 
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SQ1 (SEQ ID NO. 12) 
SQ2 (SEQ ID NO. 13) 
SQ3(SEQ ID NO. 14) 
SQ4 (SEQ ID NO. 15) 
SQ5(SEQ ID NO. 16) 
SQ6 (SEQ ID NO. 17) 
SQ7(SEQ ID NO. 18) 
SQ8(SEQIDNO. 19) 
SQ9 (SEQ ID NO. 20) 
SQ10(SEQIDN0.21) 



S-AAGTCTGGAGACCTGCT-S' 

S'-CAGCAGGTCTCCAGACT-S* 

5*-CGCACCCAAGGAATGGA-3* 

5 , -TGACACCTGGCCATTCCA-3' 

5*-CATCAGATGGTAGTTCAT-3' 

5--ATGCTGAGCGAGAGTCCATA-3' 

5'-CACTAGGTTTGCGGCAACTT-3' 

S'-GCTGTTGGCAAGCACTTACA-S' 

5'«GATCCATCCAGATCCCTGAA-3' 

5* -CAGATCAGGGCTGCTTCTA-3' 



[0034] Determining the nucleotide sequence of the 1 .5kb fragment at the S'-side and the 0.9kb fragment at the 3'-side 
revealed thatthe sequence of the overlapping region was identical, confirming that 5'- and 3'-side cDNAs of the desired 
gene were obtained. Determining the entire nucleotide sequence of the cDNA revealed that this novel gene has the full 
length of 2 kb and can code a protein consisting of 354 amino acid residues (SEQ ID NO. 1 and SEQ ID NO 2) Figure 
1 shows the relation between this gene and the EST sequences registered in the Genbank database. Comparing the 

amino acids that are well conserved between 
fam. y proteins are also conserved in this novel gene, and therefore this gene is obviously a new member of the VEGF 
fanijly (Rg. 3). In Fig. 3, HSVEGF indicates human VEGF; HSVEGF-D. HSVEGF-C. and HSVEGF-B indicate human 
VEGF homologues (human VEGF-D. human VEGF-C. and human YEGF-B, respectively); HSPDGF-A indicates 
human PDGF-A; HSPDGF-B indicates human PDGF-B; and HSP1GF2 indicates human PI G^e cTsetS 
sequences are shown in a black box. Since VEGF-D is highly homologous to VEGF-C that was cloned as the Flt4 lig- 
and, it was presumed to be a ligand to a Rt-4-like receptor. 

[0035] Deducing the signal peptide cleavage site (Fig. 4b) by hydrophobicfty plot (Rg. 4a) and the method of von Hei- 
ine (von Heijne. G, Nucleic Acids Res. 14. 4683-4690(198$)). N-terminal 21 amino acid residues may be cleaved as 



6 



10 



EP0935 001 A1 

signal peptides, and they may also undergo additional processing like VEGF-C. 
Example 4. Northern Wot analysis 

[0036] A 1 kb fragment which had been cut out by digestion with EcoRI from the 5'-fragment subcloned into pCR- 
Direct vector, was labeled with[a- 32 P]dCTP and used as a probe. Labeling was performed by random priming using 
Ready-to Go DNA labeling beads (Pharmacia). Hybridization was performed in ExpressHyb Hybridization Solution 
(Clontech) by the usual method using Multiple Tissue Northern (MTN) Blot-Human, Human II. Human Fetal and 
Human Cell lines (Clontech). Significant expression was observed in lung, heart, and intestine. Weak expression was 
observed in skeletal muscle, ovary, colon, and pancreas. The apparent molecular weight of the mRNA was 2 2 kb and 
the cloned fragment seemed to be almost the full length of the gene. 

Example 5. VEGF-D protein expression in E. coli 

15 [00371 Two primers, S'-TCCAGATCTTTTGCGGCAACTTTCTATGACAT-S' (SEQ ID NO. 22) and S'-CAGGTCGACT- 
CAAACAGGCACTAATTCAGGTAC-3' (SEQ ID NO. 23), were synthesized to amplify the region corresponding to the 
89th to 181st amino acid residues of human VEGF cDNA. The thus-obtained DNA fragment was digested with restric- 
tion enzymes Bgllt and Sail, and ligated using ligation kit II (Takara Shuzo Co., Ltd) to plasmid pQE42 ((QIAGEN) 
which had been digested with restriction enzymes BamHI and Sail. The resulting plasmid was introduced into E coli 

20 SG19003[pREP4] (QIAGEN),and a plasmid. which was obtained as designed without any mutation, was selected 
(PQE42-BS3). Plasmid pQE42-BS3 was introduced into E. coli BL21 (Invitorogen) and cultured in 10 ml of L Broth con- 
taining 100 mg/l bicucilline (ampicillin sodium for injection, Meiji Seika Kaisha. Ltd.). 200 ml of fresh L Broth was then 
inoculated with the culture. After incubation at 37*C for 1 .5 hours. IPTG was added to 3 mM, and the culture was further 
incubated at 37°C for 5 hours. After cells were harvested, a protein was purified with a Ni-NTA column following the Dro- 

25 tocol of QIAexpress Typell kit. y F 

Example 6. Expression of DHFR-VEGF-D fusion protein in E. coli 

[0038] The region corresponding to the 89th to 181st amino acid residues of human VEGF cDNA was amplified with 
30 the same primers used in Example 5. The thus-obtained DNA fragment was digested with restriction enzymes Bgll and 
Sail. The fragment was then ligated using ligation kit II (Takara Shuzo Co., Ltd.) to the plasmid pQE40 (QIAGEN) which 
had been digested with restriction enzymes BamHI and Sail. The resulting plasmid was introduced into E coli 
SG19003|pREP4] (QIAGEN), and a plasmid. which was obtained as designed without any mutation, was selected 
(PQE40-BS3). Plasmid pQE40-BS3 was introduced into E. coli BL21 (Invitrogen) and cultured in 10 ml of L Broth con- 
35 taming 100 mg/1 bicucilline (ampicillin sodium for injection, Meiji Seika Kaisha, Ltd.). 200 ml of fresh L Broth was then 
inoculated with the culture. After incubation at 37*C for 1 .5 hours. IPTG was added to 3mM, and the culture was further 
incubated at 37*C for 5 hours. After cells were harvested, a DHFR-VEGF-D fusion protein was purified with a Ni-NTA 
column following the protocol of a QIAexpress Typell kit. 

40 Example 7. Cloning mouse VEGF-D cDNA 



45 



[0039] Two Hybond-N+ (Amersham) filters (20 cm x 22 cm) on which 1.5 x 10 5 pfu of Mouse lung S'-stretch cDNA 
library was transferred were prepared. Gradient hybridization from 68°C to 55°C was performed for 2 hours in 
ExpressHyb Hybridization Solution (Clontech) using as a probe an approximately 50 ng Pvu II fragment of human 
VEGF-D. which had been labeled with a 32 P-dCTP (Amersham) using Ready-To-Go DNA Labeling Beads(-dCTP) 
(Pharmacia). The filters were washed four times in 2 x SSC, 0.05% SDS at room temperature for 10 min. then washed 
in 0.1 x SSC. 0.1% SDS at 45°C for 3 min. The washed filters were exposed overnight at -80°C using HyperFilm MP 
(Amersham) and intensifying paper. Positive clones were subjected to the second screening in the same manner as 
above to isolate a single clone. Isolated lambda DNAs were purified from the plate lysate using a QIAGEN Lambda 
so MAX I Kit (Qiagen). Insert DNAs were cut out with EcoRI and subcloned into pUC1 1 8 EcoRI/BAP (Takara Shuzo Co 
Ltd.). Its nucleotide sequence was then determined with ABI377 sequencer (Perkin Elmer). The cDNA coding the full 
length of mouse VRGF-D was reconstructed with two of the obtained clones that overlapped each other SEQ ID NO 
24 shows the nucleotide sequence of mouse VEGF-D cDNA and the deduced amino acid sequence therefrom. 

55 Example 8. Cloning rat VEGF-D cDNA 

[0040] Two Hybond-N+ (Amersham) filters (20 cm x 22 cm), on which 1 .5 x 1 0 5 pfu of Rat lung 5'-stretch cDNA library 
had been transferred, were prepared. Gradient hybridization from 68°C to 55°C was performed for 2 hours in 
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ExpressH.Fyb Hybridization Solution (Clontech) using as a probe an approximately 1 ug fragment containing 1-782 bp 
of the mouse VEGF-D cDNA which had been labeled with a^P-dCTP (Amersham) using Ready-To-Go DNA Labeling 
Beads(-dCTP) (Pharmacia). The filters were washed four times in 2 x SSC, 0.05% SDS at room temperature for 10 rrrin. 
then washed in 0.1 x SSC, 0.1% SDS at 45°C for 3 min. The washed filters were exposed overnight at -80°C using 

5 HyperRIm MP (Amersham) and intensifying paper. Positive clones were subjected to the second screening in the same 
manner as above to isolate a single clone. The isolated positive clone was excised into pBluescript using E. coli SOLAR 
(Stratagene) and helper phage ExAssist (Stratagene), then the sequence was determined with ABI377 sequencer (Per- 
kin Elmer). The sequence seemed to be the rat VEGF-D cDNA but did not contain the termination codon. 
[0041 ] To obtain the C-terminal cDNA which had not been obtained. PCR was performed using Marathon- Ready rat 

io kidney cDNA (Clontech) as a template and 5* primerGCTGCGAGTGTGTCTGTAAA (SEQ ID NO. 26) and 3* primer 
GGGTAGTGGGCAACAGTGACAGCAA (SEQ ID NO. 27) with 40 cycles of 94°C for 1 5 sec, 55°C for 30 sec, and 72 °C 
for 2 min. After the thus-obtained fragment was subcloned into pGEM-T vector (promega), the nucleotide sequence 
was determined with ABI377 sequencer (PerWn Elmer). The resulting clone contained the C-terminus of rat VEGF-D. 
Based on the results of sequencing the clone obtained by plaque hybridization and the clone obtained by PCR, the full 

75 length of the rat VEGF-D sequence was determined. SEQ ID NO. 25 shows the determined nucleotide sequence and 
the deduced amino acid sequence therefrom. 

Industrial Applicability 

20 [00421 In the present invention, a novel protein (VEGF-D) having significant homology to VEGF-C and its gene have 
been isolated. VEGF-D appears to be involved in the pathological neovascularization associated with diabetes, rheu- 
matoid arthritis, the growth of solid tumors, differentiation and proliferation of blood cells, formation of lymphatic vessels, 
and formation of edema resulting from various causes as well as the normal neovascularization at the developmental 
stage. The gene of the present invention can be used to diagnos disorders caused by abnormalities of the VEGF-D 

25 gene and gene therapy for the VEGF-D deficiency. The VEGF-D protein, which is obtained by expressing the gene of 
the present invention, can be used for healing wounds, promoting collateral vessel formation, and aiding hematopoietic 
stemcell proliferation. The antibodies or inhibitors against the VEGF-D protein can be used for treating angiodysplasia 
and lymphangiodysplasia associated with inflammation, edemas arising from various causes, dyshematopoiesis, and, 
as a novel anticancer agent, for treating pathological neovascularization. The VEGF-D protein and its antibodies can be 

30 useful for diagnosing diseases resulting from abnormal production of VEGF-D. 
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Sequence Listing 

(1) Name or appellation of Applicant: Chugai Research Institute for 
Molecular Medicine, Inc. 

(2) Title of the Invention: Novel VEGF-like Factor 

(3) Reference Number: C1-802PCT 

(4) Application Number: 

(5) Filing date: 

(6) Country where the priority application was filed and the 
, 5 application number of the application: Japan, No. Hei 8-185216 

(7) Priority date: July 15, 1996 

(8) Number of sequences: 27 

20 SEQ ID NO: 1 

SEQUENCE LENGTH: 354 
SEQUENCE TYPE; amino acid 
TOPOLOGYi linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
TISSUE TYPE : lung 
SEQUENCE DESCRIPTION: 

Met Tyr Arg Glu Trp Val Val Val Asn Val Phe Met Met Leu Tyr Val 
1 5 10 15 

Gin Leu Val Gin Gly Ser Ser Asn Glu His Gly Pro Val Lys Arg Ser 

20 25 30 

Ser Gin Ser Thr Leu Glu Arg Ser Glu Gin Gin lie Arg Ala Ala Ser 

35 40 45 

Ser Leu Glu Glu Leu Leu Arg lie Thr His Ser Glu Asp Trp Lys Leu 
50 55 60 

45 Trp Arg Cys Arg Leu Arg Leu Lys Ser Phe Thr Ser Met Asp Ser Arg 
65 70 75 80 

Sec Ala Ser His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp lie 

85 90 95 

Glu Thr Leu Lys Val lie Asp Glu Glu Trp Gin Arg Thr Gin Cys Ser 
100 105 110 
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Pro Arg Glu Thr Cys Val Glu Val Ala Ser Glu Leu Giy Lys Ser Thr 

115 120 125 

Asn Thr Phe Phe Lys Pro Pro Cys Val Asn Val Phe Arg Cys Gly Gly 

. 130 135 140 

Cys Cys Asn Glu Glu Ser Leu lie Cys Met Asn Thr Ser Thr Ser Tyr 
145 -150 155 160 

lie Ser Lys Gin Leu Phe Glu lie Ser Val Pro Leu Thr Ser Val Pro 

165 170 175 

Glu Leu val Pro Val Lys Val Ala Asn His Thr Gly Cys Lys Cys Leu 

180 185 190 

Pro Thr Ala Pro Arg His Pro Tyr Ser lie lie Arg Arg Ser lie Gin 

195 200 205 

lie Pro Glu Glu Asp Arg Cys Ser His Ser Lys Lys Leu Cys Pro lie 

210 215 220 

Asp Met Leu Trp Asp Ser Asn Lys Cy3 Lys Cys Val Leu Gin Glu Glu 
225 230 235 240 

Asn Pro Leu Ala Gly Thr Glu Asp His Ser His Leu Gin Glu Pro Ala 

245 250 255 

Leu Cys Gly Pro His Met Met Phe Asp Glu Asp Arg Cys Glu Cys Val 

260 265 270 

Cys Lys Thr Pro Cys Pro Lys Asp Leu lie Gin His Pro Ly3 Asn Cys 

275 280 285 

Ser Cys Phe Glu Cys Lys Glu Ser Leu Glu Thr Cys Cys Gin Lys His 

290 295 300 

Lys Leu Phe His Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe 
305 310 315 320 

His Thr Arg Pro Cys Ala Ser Gly Lys Thr Ala Cys Ala Lys His Cys 

325 330 335 

Arg Phe Pro Lys Glu Lys Arg Ala Ala Gin Gly Pro His Ser Arg Lys 
340 345 350 

Asn Pro 



SEQ ID NO: 2 
SEQUENCE LENGTH: 2004 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS : double 
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TOPOLOGY: linear 
MOLECULE TYPE i cDNA to mRNA 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
TISSUE TYPE: lung 
FEATURE : 

NAME/KEY: CDS 
LOCATION: 403.. 1464 
IDENTIFICATION METHOD : E 
IS SEQUENCE DESCRIPTION: 

CCAGCTTTCT GTARCTGTAA GCATTGCTGC CCACACCACC TCCTTACAAA GCAACTAGAA 60 
CCTGCGGCAT ACATTGGAGA GATTTTTTTA ATTTTCTGGA CAYGAAGTAA ATT TAG AG TG 120 
CTTTCYAATT TCAGGTACAA GACATGTCCA CCTTCTGATT ATTTTTGGAG AACATTTTGA 180 

20 

TTTTTTTCAT CTCTCTCTCC CCACCCCTAA GATTGTGCAA AAAAAGCGTA CCTTGCCTAA 240 
TTGAAATAAT TTCATTGGAT TTTGATCAGA ACTGATCATT TGGTTTTCTG TGTGAAGTTT 300 
TGAGGTTTCA AACTTTCCTT CTGGAGAATG CCTTTTGAAA CAATTTTCTC TAGCTGCCTG 360 
25 ATGTCAACTG CTTAGTAATC AGTGGATATT GAAATATTCA AA ATG TAC AGA GAG 414 

Met Tyr Arg Glu 

1 

TGG GTA GTG GTG AAT GTT TTC ATG ATG TTG TAC GTC CAG CTG GTG CAG 462 

30 

Trp Val Val Val Asn Val Phe Met Met Leu Tyr Val Gin Leu Val Gin 
5 10 . 15 20 

GGC TCC AGT AAT GAA CAT GGA CCA GTG AAG CGA TCA TCT CAG TCC ACA 510 
35 Gly Ser Ser Asn Glu His Gly Pro Val Lys Arg Ser Ser Gin Ser Thr 

25 30 35 

TTG GAA CGA TCT GAA CAG CAG ATC AGG GCT GCT TCT AGT TTG GAG GAA 558 
Leu Glu Arg Ser Glu Gin Gin He Arg Ala Ala Ser Ser Leu Glu Glu 

40 

40 45 50 

CTA CTT CGA ATT ACT CAC TCT GAG GAC TGG AAG CTG TGG AGA TGC AGG 606 
Leu Leu Arg He Thr His Ser Glu Asp Trp Lys Leu Trp Arg Cys Arg 
45 55 60 65 

CTG AGG CTC AAA AGT TTT ACC AGT ATG GAC TCT CGC TCA GCA TCC CAT 654 
Leu Arg Leu Lys Ser Phe Thr Ser Met Asp Ser Arg Ser Ala Ser His 
70 75 80 

50 

CGG TCC ACT AGG TTT GCG GCA ACT TTC TAT GAC ATT GAA ACA CTA AAA 702 
Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr As? He Glu Thr Leu Lys 

55 
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85 90 95 100 

GTT ATA GAT GAA GAA TGG CAA AGA ACT CAG TGC AGC CCT AGA GAA ACG 750 
Val lie Asp Glu Glu Trp Gin Arg Thr Gin Cys Ser Pro Arg Glu Thr 

105 110 115 

TGC GTG GAG GTG GCC AGT GAG CTG GGG AAG AGT ACC AAC ACA TTC TTC 798 
Cys Val Glu Val Ala Ser Glu Leu Gly Lys Ser Thr Asn Thr Phe Phe 

120 125 130 

AAG CCC CCT TGT GTG AAC GTG TTC CGA TGT GGT GGC TGT TGC AAT GAA 846 
Lys Pro Pro Cys Val Asn Val Phe Arg Cys Gly Gly Cys Cys Asn Glu 
15 135 140 145 

GAG AGC CTT ATC TGT ATG AAC ACC AGC ACC TCG TAC ATT TCC AAA CAG 094 
Glu Ser Leu lie Cys Met Asn Thr Ser Thr Ser Tyr lie Ser Lys Gin 
150 155 160 

20 

CTC TTT GAG ATA TCA GTG CCT TTG ACA TCA GTA CCT GAA TTA GTG CCT 942 
Leu Phe Glu lie Ser Val Pro Leu Thr Ser Val Pro Glu Leu Val Pro 
165 170 175 180 

25 GTT AAA GTT GCC . AAT CAT ACA GGT TGT AAG TGC TTG CCA ACA GCC CCC 990 

Val Lys Val Ala Asn His Thr Gly Cys Lys Cys Leu Pro Thr Ala Pro 

185 190 195 

CGC CAT CCA TAC TCA ATT ATC AGA AGA TCC ATC CAG ATC CCT GAA GAA 1038 
Arg His Pro Tyr Ser He lie Arg Arg Ser He Gin He Pro Glu Glu 

200 205 210 

GAT CGC TGT TCC CAT TCC AAG AAA CTC TGT CCT ATT GAC ATG CTA TGG 1086 
35 Asp Arg Cys Ser His Ser Lys Lys Leu Cys Pro lie Asp Met Leu Trp 

215 220 225 

GAT AGC AAC AAA TGT AAA TGT GTT TTG CAG GAG GAA AAT CCA CTT GCT 1134 
Asp Ser Asn Lys Cys Lys Cys Val Leu Gin Glu Glu Asn Pro Leu Ala 

40 

230 235 240 

GGA ACA GAA GAC CAC TCT CAT CTC CAG GAA CCA GCT CTC TGT GGG CCA 11B2 
Gly Thr Glu Asp His Ser His Leu Gin Glu Pro Ala Leu Cys Gly Pro 
45 245 250 255 260 

CAC ATG ATG TTT GAC GAA GAT CGT TGC GAG TGT GTC TGT AAA ACA CCA 1230 
His Met Met Phe Asp Giu Asp Arg Cys Glu Cys Val Cys Lys Thr Pro 

265 270 275 

50 

TGT CCC AAA GAT CTA ATC CAG CAC CCC AAA AAC TGC AGT TCC TTT GAG 1278 
Cys Pro Lys Asp Leu He Gin His Pro Lys Asn Cys Ser Cys Phe Glu 

55 
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280 285 290 

TGC AAA GAA AGT CTG GAG ACC TGC TGC CAG AAG CAC AAG CTA TTT CAC 1326 
Cys Lys Glu Ser Leu Glu Thr Cys Cys Gin Lys His Lys Leu Phe His 

295 300 305 

CCA GAC ACC TGC AGC TGT GAG GAC AGA TGC CCC TTT CAT ACC AGA CCA 1374 
10 Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe His Thr Arg Pro 

310 315 320 

TGT GCA AGT GGC AAA ACA GCA TGT GCA AAG CAT TGC CGC TTT CCA AAG 1422 
Cys Ala Ser Gly Lys Thr Ala Cys Ala Lys His Cys Arg Phe Pro Lys 
325 330 335 340 

GAG AAA AGG GCT GCC CAG GGG CCC CAC AGC CGA AAG AAT CCT 1464 
Glu Lys Arg Ala Ala Gin Gly Pro His Ser Arg Lys Asn Pro 
20 345 350 

TGATTCAGCG TTCCAAGTTC CCCATCCCTG TCATTTTTAA CAGCATGCTG CTTTGCCAAG 1524 
TTGCTGTCAC TGTTTTTTTC CCAGGTGTTA AAAAAAAAAT C CAT TTT ACA CAGCACCACA 1584 
GTGAATCCAG ACCAACCTTC CATTCACACC AG CT AAG GAG TCCCTGGTTC ATTGATGGAT 1644 

25 

GTCTTCTAGC TGCAGATGCC TCTGCGCACC AAGGAATGGA GAGGAGGGGA CCCATGTAAT 1704 
CCTTTTGTTT AG TTT TGT TT TTGTTTTTTG GTGAATGAGA AAGGTGTGCT GGTCATGGAA 1764 
TGGCAGGTGT CATATGACTG ATTACTCAGA GCAGATGAGG AAAACTGTAG TCTCTGAGTC 1824 
30 CTTTGCTAAT CGCAACTCTT GTGAATTATT CTGATTCTTT TTT AT GC AGA ATTTGATTCG 1884 

TATGATCAGT ACTGACTTTC TGATTACTGT CCAGCTTATA GTCTTCCAGT TTAATGAACT 1944 
ACCATCTGAT GTTTCATATT TAAGTGTATT TAAAGAAAAT AAACACCATT ATTCAAGTCT 2004 

35 

SEQ ID NO: 3 

SEQUENCE LENGTH: 16 

SEQUENCE TYPE: amino acid 
40 TOPOLOGY: linear 

MOLECULE TYPE: peptide 

ORIGINAL SOURCE: 
45 ORGANISM: Homo sapiens 

TISSUE TYPE: lung 

SEQUENCE DESCRIPTION: 

Cys Gly Pro Asn Lys Glu Leu Asp Glu Asn Thr Cys Gin Cys Val Cys 

SO 

15 10 15 
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SEQ ID NO: 4 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other- nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
AGGGATGGGG AACTTGGAAC . GCTGAAT 

SEQ ID NO: 5 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
GATCTAATCC AGCACCCCAA AAACTGC 

SEQ ID NO: 6 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCATCCTAAT ACGACTCACT ATAGCGC 

SEQ ID NO: 7 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

CTGGTTCGGC CCAGAACTTG GAACGCTGAA TCA 
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SEQ ID NO: 8 
SEQUENCE LENGTH: 32 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION: 

CTCGCTCGCC CACTAATACG ACTCACTATA GG 

SEQ ID NO: 9 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
AATTAACCCT CACTAAAGGG 

SEQ ID NO: 10 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCAGGGTTTT CCCAGTCACG AC 

SEQ ID NO: 11 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
ACTCACTATA GGGCTCGAGC GGC 
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SEQ ID NO: 12 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other ■ nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
AAGTCTGGAG ACCTGCT 

SEQ ID NO: 13 
SEQUENCE LENGTH : 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CAGCAGGTCT CCAGACT 

SEQ ID NO: 14 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CGCACCCAAG GAATGGA 

SEQ ID NO: 15 
SEQUENCE LENGTH: 18 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
TGACACCTGG CCATTCCA 
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S£Q ID NO: 16 
SEQUENCE LENGTH; 18 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other ■ nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CATCAGATGG TAGTTCAT 

SEQ ID NO: 17 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
ATGCTGAGCG AGAGTCCATA 

SEQ ID NO: 18 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CACTAGGTTT GCGGCAACTT 

SEQ ID NO: 19 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
GCTGTTGGCA AGCACTTACA 



17 



EP0 935 001 A1 



SEQ ID NO: 20 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other- nucleic acid, synthetic ON A 
SEQUENCE DESCRIPTION: 
GATCCATCCA GATCCCTGAA 

SEQ ID NO: 21 
SEQUENCE LENGTH: 19 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CAGATCAGGG CTGCTTCTA 

SEQ ID NO: 22 
SEQUENCE LENGTH: 32 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION: 

TCCAGATCTT TTGCGGCAAC TTTCTATGAC AT 

SEQ ID NO: 23 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

CAGGTCGACT CAAACAGGCA CTAATTCAGG TAC 
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SEQ ID NO: 24 
SEQUENCE LENGTH: 1581 
SEQUENCE TYPE: nucleic acid 
STRANDEDNBSS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cONA to mRNA 
ORIGINAL SOURCE: 

ORGANISM: mouse 
TISSUE TYPE: lung 
IS FEATURE: 

NAME /KEY: CDS 
LOCATION r 96.. 1169 
IDENTIFICATION METHOD: E 

20 

SEQUENCE DESCRIPTION: 

TTCCGGGCTT TGCTGGAGAA TGCCTTTTGC AACACTTTTC AGTAGCTGCC TGGAAACAAC 60 
TGCTTAGTCA TCGGTAGACA TTTAAAATAT TCAAA ATG TAT GGA GAA TGG GGA 113 
25 Met Tyr Gly Glu Trp Gly 

1 5 
ATG GGG AAT ATC CTC ATG ATG TTC CAT GTG TAC TTG GTG CAG GGC TTC 161 
Met Gly Asn He Leu Met Met Phe His Val Tyr Leu Val Gin Gly Phe 

10 15 20 

AGG AGC GAA CAT GGA CCA GTG AAG GAT TTT TCT TTT GAG CGA TCA TCC 209 

« 

Arg Ser Glu His Gly Pro Val Lys Asp Phe Ser Phe Glu Arg Ser Ser 

25 30 35 

CGG TCC ATG TTG GAA CGA TCT GAA CAA CAG ATC CGA GCA GCT TCT AGT 257 
Arg Ser Met Leu Glu Arg Ser Glu Gin Gin He Arg Ala Ala Ser Ser 

40 45 50 

TTG GAG GAG TTG CTG CAA ATC GCG CAC TCT GAG GAC TGG AAG CTG TGG 305 
Leu Glu Glu Leu Leu Gin He Ala His Ser Glu Asp Trp Lys Leu Trp 
55 60 65 70 

4S CGA TGC CGG TTG AAG CTC AAA AGT CTT GCC AGT ATG GAC TCA CGC TCA 353 

Arg Cys Arg Leu Lys Leu Lys Ser Leu Ala Ser Met Asp Ser Arg Ser 

75 80 05 

GCA. TCC CAT CGC TCC ACC AGA TTT GCG GCA ACT TTC TAT GAC ACT GAA 401 
Ala Ser His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp Thr Glu 
90 95 100 
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ACA CTA 
Thr Leu 

AGA GAG 
Arg Glu 
120 
ACA TTC 
Thr Phe 
135 

TGC AAC 
Cys Asn 

TCC AAA 
Ser Lys 

TTA GTG 
Leu Val 

ACG GGC 
Thr Gly 
200 
CCA GAA 
Pro Glu 
215 

ATG CTG 
Met Leu 



AAA GTT 
Lys Val 
105 

ACA TGC 
Thr Cys 

TTC AAG 
Phe Lys 

GAA GAG 
Glu Glu 

CAG CTC 
Gin Leu 
170 
CCT GTT 
Pro Val 
185 

CCC CGC 
Pro Arg 



ATA GAT GAA 
He Asp Glu 

GTA GAA GTC 
Val Glu Val 
125 

CCC CCC TGT 
Pro Pro Cys 
140 

GGT GTG ATG 
Gly Val Met 
155 

TTT GAG ATA 
Phe Glu He 

AAA ATT GCC 
Lys He Ala 



GAA TGG CAG AGG 
Glu Trp Gin Arg 
110 

GCC AGT GAG CTG 
Ala Ser Glu Leu 



GAA GAT 
Glu Asp 

TGG GAT 
Trp Asp 



50 



CCA CTG 
Pro Leu 

TGT GGA 
Cys Gly 

AAA GCA 
Lys Ala 
280 



CCT GGG 
Pro Gly 
250 
CCG CAC 
Pro His 
265 

CCA TGT 
Pro Cys 



CAT CCT TAC 
His Pro Tyr 
205 

GAA TGT. CCT 
Glu Cys Pro 
220 

AAC ACC AAA 
Asn Thr Lys 
235 

ACA GAA GAC 
Thr Glu Asp 

ATG ACG TTT 
Met Thr Phe 

CCG GGA GAT 
Pro Gly Asp 
285 



GTA AAT 
Val Asn 

TGT ATG 
Cys Met 

TCA GTG 
Ser Val 
175 
AAC CAT 
Asn His 
190 

TCA ATT 
Ser He 



GTC TTC 
Val Phe 
145 
AAC ACA 
Asn Thr 
160 

CCT CTG 
Pro Leu 

ACG GGT 
Thr Gly 

ATC AGA 
He Arg 



CAT TCC 
His Ser 

TGT AAA 
Cys Lys 

CAC TCT 
His Ser 
255 
GAT GAA 
Asp Glu 
270 

CTC ATT 
Leu He 



AAG AAA 
Lys Lys 
225 
TGT GTT 
Cys Val 
240 

TAC CTC 
Tyr Leu 

GAT CGC 
Asp Arg 

CAG CAC 
Gin His 



ACC CAA TGC 
Thr Gin Cys 
115 

GGG AAG ACA 
Gly Lys Thr 
130 

CGG TGT GGA 
Arg Cys Gly 

AGC ACC TCC 
Ser Thr Ser 

ACA TCA GTG 
Thr Ser Val 
180 

TGT AAG TGC 
Cys Lys Cys 

195 
AGA TCC ATT 
Arg Ser He 
210 

CTC TGT CCT 
Leu Cys Pro 



AGC CCT 
Ser Pro 

ACC AAC 
Thr Asn 

GGC TGC 
Gly Cys 
150 
TAC ATC 
Tyr He 
165 

CCC GAG 
Pro Glu 

TTG CCC 
Leu Pro 

CAG ACC 
Gin Thr 



449 



TTG CAA GAC 
Leu Gin Asp 

CAG GAA CCC 
Gin Glu Pro 
260 

TGT GAG TGC 
Cys Glu Cys 

275 
CCG GAA AAC 
Pro Glu Asn 
290 



ATT GAC 
He Asp 
230 
GAG ACT 
Glu Thr 
245 

ACT CTC 
Thr Leu 

GTC TGT 
Val Cys 

TCC AGT 
Cys Ser 
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737 



785 



833 



881 
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977 
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10 



TGC TTT GAG TGC AAA GAA AGT CTG GAG AGC TGC TGC CAA AAG CAC AAG 1025 
Cys Phe Glu Cys Lys Glu Ser Leu Glu Ser Cys Cys Gin Lys His Lys 
295 300 305 310 

ATT TTT CAC CCA GAC ACC TGC AGC TGT GAG GAC AGA TGT CCT TTT CAC 1073 
lie Phe His Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe His 

315 * 320 325 

ACC AGA ACA TGT GCA AGT AGA AAG CCA GCC TGT GGA AAG CAC TGG CGC 1121 
Thr Arg Thr Cys Ala Ser Arg Lys Pro Ala Cys Gly Lys His Trp Arg 
330 335 340 

15 TTT CCA AAG GAG ACA AGG GCC CAG GGA CTC TAC AGC CAG GAG AAC CCT 1169 

Phe Pro Lys Glu Thr Arg Ala Gin Gly Leu Tyr Ser Gin Glu Asn Pro 

345 350 355 

TGATTCAACT TCCTTTCAAG TCCCCCCATC TCTGTCATTT TAAACAGCTC ACTGCTTTGT 1229 

20 

CAAGTTGCTG TCACTGTTGC CCACTACCCC TGCCCCCCCC CCCCCCCGCC TCCAGGTGTT 1289 
AGAAAAGTTG ATTTGACCTA GTGTCATGGT AAAGCCACAT TTCCATGCAA TGGCGGCTAG 1349 
CTGATTCCCC AGTTCACTGA CAAATGACTT GTAGCTTCAA ATGTCTTTGC GCCATCAKCA 1409 
25 CTCAAAAAGG AAGGGGTCTG AAGAACCCCT TGTTTGATAA ATAAAAACAG GTGCCTGAAA 1469 

CAAAATATTA GGTGCCACTC GATTGGGTCC CTCGGGCTGG CCAAATTCCA AGGGCAATGC 1529 
TCCTGAATTT ATTGTGCCCC TTCCTTAATG CGGAATTTCC TTTTGTTTGA TT 1581 

30 

SEQ ID NO: 25 
SEQUENCE LENGTH: 1491 . 
SEQUENCE TYPE: nucleic acid 
35 STRANDEDNESS t double 

TOPOLOGY; linear 
MOLECULE TYPE i CDNA to tnRNA 
ORIGINAL SOURCE : 

ORGANISM: rat 

TISSUE TYPE: lung 
FEATURE : 
45 NAME /KEY: COS 

LOCATION: 270.. 1247 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 

» 

GCCACCTCTT GATTATTTGT GCAGCGGGAA ACTTTGAAAT AGTTTTCATC TCTTTCTCCC 60 
AT ACT AAG AT TGTGTGTGGC CGTGGGGGAG TCCTTGACTA ACTCAAGTCA TTTCATTGGA 120 
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TTTTGATTAC AACTGATCAT GTGATATTTT TTTCCATGTA AAGTTTTGGG GCTTCAAACT 180 
TTGCTTCTGG AGAATGCCTT TTGCAACACT TTTCAGTAGC TGCCTGGAAA CAACTGCTTA 240 
5 GCCATCAGTG GACATTTGAA ATATTCAAA ATG TAT GGA GAG TGG GCC GCA GTG 293 

Met Tyr Gly Glu Trp Ala Ala Val 

1 5 

AAT ATT CTC ATG ATG TCC TAT GTG TAC CTG GTG CAG GGC TTC AGT ATT 341 
Asn lie Leu Met Met Ser Tyr Val Tyr Leu Val Gin Gly Phe Ser lie 

10 15 20 

GAA CAC CGA GCA GTG AAG GAT GTT TCT CTT GAG CGA TCA TCC CGG TCT 389 
Glu His Arg Ala Val Lys Asp Val Ser Leu Glu Arg Ser Ser Arg Ser 
25 30 35 40 

GTG TTG GAA CGT TCT GAA CAA CAG ATC CGC GCG GCT TCT ACT TTG GAA 437 
Val Leu Glu Arg Ser Glu Gin Gin He Arg Ala Ala Ser Thr Leu Glu 

45 50 55 

GAG TTG CTG CAA GTC GCA CAC TCT GAG GAC TGG AAG CTG TGG CGG TGC 485 
Glu Leu Leu Gin Val Ala His Ser Glu Asp Trp Lys Leu Trp Arg Cys 
25 SO 65 70 

CGG TTG AAG CTT AAA AGT CTT GCC AAT GTG GAC TCG CGC TCA ACA TCC 533 
Arg Leu Lys Leu Lys Ser Leu Ala Asn Val Asp Ser Arg Ser Thr Ser 

75 80 85 

CAT CGC TCC ACC AGA TTT GCG GCA ACT TTC TAT GAT ACT GAA ACA CTA 581 
His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp Thr Glu Thr Leu 

90 95 100 

AAA GTT ATA GAT GAA GAA TGG CAG AGG ACC CAA TGC AGC CCT AGA GAG 629 
Lys Val He Asp Glu Glu Trp Gin Arg Thr Gin Cys Ser Pro Arg Glu 
105 HO H5 120 

ACA TGC GTA GAA GTC GCC AGT GAG CTG GGG AAG ACA ACC AAC ACA TTT 677 
Thr Cys Val Glu Val Ala Ser Glu Leu Gly Lys Thr Thr Asn Thr Phe 

125 130 135 

TTC AAG CCC CCT TGT GTA AAT GTC TTC CGG TGT GGA GGA TCC TGC AAT 725 
45 Phe Lys Pro Pro Cys Val Asn Val Phe Arg Cys Gly Gly Cys Cys Asn 

140 145 150 

GAA GAG AGC GTG ATG TGT ATG AAC ACA AGC ACC TCC TAC ATC TCC AAA 773 
Glu Glu Ser Val Met Cys Met Asn Thr Ser Thr Ser Tyr He Ser Lys 

155 160 165 

CAG CTC TTT GAG ATA TCA GTG CCT CTG ACA TCA GTG CCC GAG TTA GTG 821 
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Gin Leu Phe Glu lie Ser Val Pro Leu Thr Ser Val Pro Glu Leu Val 

170 175 180 

CCT GTT AAA ATT GCC AAC CAT ACG GGT TGT AAG TGT TTG CCC ACG GGC 869 
Pro Val Lys He Ala Asn His Thr Gly Cys Lys Cys Leu Pro Thr Gly 
185 190 195 200 

CCC CGG CAT CCT TAT TCA ATT ATC AGA AGA TCC ATT CAG ATC CCA GAA 917 
Pro Arg His Pro Tyr Ser He He Arg Arg Ser He Gin lie Pro Glu 

205 210 215 

GAA GAT CAA TGT CCT CAT TCC AAG AAA CTC TGT CCT GTT GAC ATG CTG 965 
Glu Asp Gin Cys Pro His Ser Lys Lys Leu Cys Pro Val Asp Met Leu 

220 225 230 

TGG GAT AAC ACC AAA TGT AAA TGT GTT TTA CAA GAT GAG AAT CCA CTG 1013 
20 Trp Asp Asn Thr Lys Cys Lys Cys Val Leu Gin Asp Glu Asn Pro Leu 

235 240 245 

CCT GGG ACA GAA GAC CAC TCT TAC CTC CAG GAA CCC GCT CTC TGT GGA 1061 
Pro Gly Thr Glu Asp His Ser Tyr Leu Gin Glu Pro Ala Leu Cys Gly 

25 

250 255 260 

CCA CAC ATG ATG TTT GAT GAA GAT CGC TGC GAG TGT GTC TGT AAA GCA 1109 
Pro His Met Met Phe Asp Glu Asp Arg Cys Glu Cys Val Cys Lys Ala 
30 265 270 275 280 

CCA TGT CCT GGA GAT CTC ATT CAG CAC CCG GAA AAC TGC AGT TGC TTT 1157 
Pro Cys Pro Gly Asp Leu. He Gin His Pro Glu Asn Cys Ser Cys Phe 

285 290 295 

GAA TGC AAA GAA AGT CTG GAA AGC TGT TGC CAA AAG CAC AAG ATG TTT 1205 
Glu Cys Lys Glu Ser Leu Glu Ser Cy3 Cys Gin Lys His Lys Met Phe 

300 305 310 

CAC CCT GAC ACC TGC AGA TCA ATG GTC TTT TCA CTG TCC CCT 1247 
His Pro Asp Thr Cys Arg Ser Met Val Phe Ser Leu Ser Pro 
315 320 325 

45 TAATTTGGTT TACTGGTGAC ATTTAAAGGA CATACTAACC TGATTTATTG GGGCTCTTTT 1307 

CTCTCAGGGC CCAAGCACAC TCTTAAAGGA ACACAGACGT TTGGCCTCTA AGAAATACAT 1367 
GGAAGTATTA TAGAGTGATG ATTAAATTGT CTTCTTGTTT CAAACAGGGT CTC ATG ATT A 1427 
CAGACCCGTA TTGCCATGCC TGCCGTCATG C TAT C ATG AG CGG AAAAGAA TCACTGGCAT 1487 
TTAA 1491 
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SEQ ID NO: 26 

SEQUENCE LENGTH: 20 
5 SEQUENCE TYPE: nucleic acid 

STRANOEDNESS : single 

TOPOLOGY: linear 
w MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION: 

GCTGCGAGTG TGTCTGTAAA 20 

15 

SEQ ID NO: 27 
SEQUENCE LENGTH: 25 
SEQUENCE TYPE: nucleic acid 
20 STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA. 
SEQUENCE DESCRIPTION: 

25 

GGGTAGTGGG CAACAGTGAC AGCAA 25 



30 

Claims 

1. A protein shown by SEQ ID NO: 1 or having the amino acid sequence derived therefrom in which one or more 
35 amino acids are substituted, deleted, or added. 

2. A protein encoded by a DNA hybridizing with the DNA shown by SEQ ID NO: 2. 

3. A DNA encoding the protein of Claim 1 . 

40 

4. A DNA hybridizing with the DNA shown by SEQ ID NO: 2. 

5. A vector containing the DNA of Claim 3 or 4. 
45 6. A transformant carrying the vector of Claim 5. 

7. A method of producing the protein of Claim 1 or 2, which comprises culturing the transformant of Claim 6. 

8. An antibody binding to the protein of Claim 1 or 2. 

so 

9. A method of screening a compound binding to the protein of Claim 1 or 2, which comprises a step of detecting the 
activity of the protein of Claim 1 a 2 to bind to a test sample. 

10. A compound binding to the protein of Claim 1 or 2, wherein the compound have been isolated by the method of 
55 Claim 9. . 
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Fig. 1 
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Fig. 2 
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Fig. 3 
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Fig. 4 
a) Hydrophobic ity 
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b) Prediction of the human VEGF-D signal peptide 
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